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Abstract 



We aim to quantify the distribution of information in the Hawking radiation and inside the black 
hole in the semiclassical evaporation process. The structure of relativistic quantum field theory does not 
allow defining a localized entropy unambiguously, but rather forces to consider the shared information 
(mutual information) between two different regions of space-time. Using this tool, we first show that 
the entropy of a thermal gas at the Unruh temperature underestimates the actual amount of (shared) 
information present in a region of the Rindler space. Then, we analyze the mutual information between 
the black hole and the late time radiation region. A well known property of the entropy implies that this 
is monotonically increasing with time. This means that in the semiclassical picture it is not possible to 
recover the eventual purity of the initial state in the final Hawking radiation through subtle correlations 
established during the whole evaporation period, no matter the interactions present in the theory. We 
find extensivity of the entropy as a consequence of a reduction to a two dimensional conformal problem 
in a simple approximation. However, the extensivity of information in the radiation region in a full 
four dimensional calculation seems not to be guaranteed on general grounds. We also analyze the 
localization of shared information inside the black hole finding that a large amount of it is contained in 
a small approximately flat region of space-time near the point where the horizon begins. This gives place 
to large violations of the entropy bounds. We show that this problem is not eased by backscattering 
effects and argue that a breaking of conformal invariance is necessary to delocalize the entropy. Finally, 
£N) ■ we indicate that the mutual information could lead to a way to understand the Bekenstein-Hawking 

black hole entropy which does not require a drastic reduction in degrees of freedom in order to regulate 
the entanglement entropy. On the contrary, a large number of field degrees of freedom at high energies 
giving place to a Hagedorn transition implements a natural distance cutoff in the mutual information, 
which may in consequence turn out to be bounded. 

X. 

c5 ■ 1 Introduction 

Since the early days of quantum mechanics it was recognized that the relativistic analog of the non- 
relativistic quantum theory of a single particle contains inconsistencies. As it turned out, the root of the 
problem is in the concept of a localizable particle, which ceases to be meaningful in the relativistic case. 
In this later, the localization process inevitably leads to the creation of particle-antiparticle pairs, and 
requires the simultaneous description of an infinite number of degrees of freedom. This is achieved by the 
quantum field theory (QFT), which is the natural synthesis of quantum mechanics and special relativity 
allowing for a sharp space-time localization of events. 

This proliferation of degrees of freedom due to Lorentz invariance and locality has many important 
consequences. One of those is the presence of ambiguities when one attempts to define the entropy 
enclosed in a given volume. In order to define this entropy, consider a global state (density matrix) p. We 
can reduce it to the spatial set V by tracing out the degrees of freedom lying outside V 

p v = ti- V p. (1) 
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The density matrix py is the localized state relevant for computing expectation values of the observables 
acting in V . However, its von Newman entropy (which in the case of the vacuum state p = |0) (0| is called 
entanglement or geometric entropy) 

S(V) = -tvp v log p v , (2) 

is divergent [U [2] . This is due to the infinite amount of entanglement present in the vacuum fluctuations 
around the boundary of V. The trace leaves one member of a virtual pair as a "real particle" inside V 
while the other outside V is traced over. Due to their ultraviolet nature, the divergent terms appearing 
in the entanglement entropy of the vacuum state also show up in the entropy of any finite energy state 
reduced to the region V. 

In the text-book calculation of the entropy of a gas this problem is bypassed by imposing artificial 
boundary conditions on the region considered (i.e. a box with perfectly reflecting walls), and taking a 
state and a theory defined inside, avoiding in this manner the entanglement with the exterior region. 
Then, intensive quantities such as the entropy per unit volume are computed in the large volume limit. In 
this way one obtains results which are generally independent of the particular boundary conditions, since 
they contribute at most with a term increasing like the boundary area. Alternatively, one can get the 
same result for the mean entropy by imposing a cutoff to regularize the entropy of py and then letting the 
volume go to infinity before removing the cutoff. A theorem based on the strong subadditive property of 
the entropy guarantee the existence of this limit in translational invariant situations [3]. Therefore, in the 
context of special relativity, the localized entropy is tied to entropy density and to entropy extensivity. 

However, the ambiguities are unavoidable when the large volume limit cannot be taken. This is 
the case if one is interested in the entropy radiated by a black hole, or the one inside a cosmological 
horizon. Moreover, the existence of these types of horizons is inevitable when taking a large volume limit 
of homogeneous matter in the presence of the gravitational interaction, disregarding how weak this later 
is. This means that the black hole entropy problem is deeply related with the issue of state localization 
in relativity. 

Therefore, strictly speaking, in semiclassical gravity there is no well defined notion of localized entropy, 
not even in the sense which is allowed in special relativity. However, it can be argued that in some 
situations the high energy modes responsible for the entropy divergences decouple from the low energy 
ones. This may manifest itself in the presence of different terms (area and volume growing terms for 
example) which can be isolated and approximately separated. Thus, if this decoupling is efficient, one 
can achieve the large volume limit before gravity becomes relevant. Still, the decoupling is not evident at 
least for diluted gases where the contribution of the boundary to the low energy modes is of the order of 
the entropy one is interested in evaluating. This is certainly the case of the Hawking radiation, and may 
even be relevant to the radiation escaping from a star in the far away region. 

In relation with the scenarios of horizon formation, several entropy bounds have been proposed in the 
literature [HE]. These would forbid the entropy to grow faster than a constant times the boundary area 
of the region considered, giving a sharp expression to the problem of the entropy ambiguity. If a form of 
entropy bound holds in nature we certainly need to find new principles which guarantee the extensivity of 
the entropy. This may be part of the difficult task usually attributed to the holographic principle, which 
would be operative in quantum gravity [6]. 

However, we think this problem seems to require new ideas at low energies. Indeed, imagine we have 
come out with an expression for the entropy of a set V in flat empty Minkowski space. One may wonder 
what this entropy would actually do: describe the empty space seen by an inertial observer or rather 
the entropic radiation measured by an accelerated one. This phenomenon of observer dependence of the 
entropy has been pointed out in [7|. This suggests that the entropy ambiguity is somehow required by 
relativity and that it cannot be simply eliminated. In other words, it should not be possible to solve the 
localization problem just by producing a unique prescription to compute the entropy corresponding to a 
region in the classical limit of the space-time (for a different opinion see for example [8]). This would be 
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a new manifestation of the well known fact that a proper definition of entropy requires the specification 
of a coarse graining. 

In this paper we explore the question of entropy localization and extensivity at the semiclassical level. 
The entropy is then necessarily ambiguous. Instead of it we search for quantities which may replace the 
notion of localized entropy in QFT. We find that there is a natural and almost unique answer, which 
is given by the mutual information between two different non-intersecting regions. It has the meaning 
of the information shared by these two regions and it is regularization independent and well defined 
mathematically. In some sense it can be considered entropy in which a specific form of coarse graining 
has been imposed by the structure of the theory. We propose it as the correct tool to evaluate the entropy 
carried by the vacuum fluctuations. 

Two properties of the mutual information are relevant in this context: it is monotonically increasing 
with the size, and it gives a lower bound to the entropy on the regions involved. This later property was 
used in an attempt to properly define and test the entropy bounds at the semiclassical level [9]. Here, 
we study the mutual information in the Schwarzschild black hole evaporation problem. In a simple two 
dimensional approximation we find, somewhat surprisingly, extensivity for the mutual information on 
the radiation region. For the full four dimensional case the calculation is more involved, and we could 
not obtain a closed result. We point out however that the localization properties of the entropy in the 
Hawking radiation are far from clear, and in principle it can be distributed very differently than the 
radiated energy. 

We also study the localization of the entropy inside the black hole, and find that it is strongly concen- 
trated near the point where the horizon begins. We find that the amount of shared entropy there exceed 
the Bekenstein-Hawking entropy (or black hole entropy) Sbh = A/(4G). We discuss this result, and use 
the mutual information in trying to give a semiclassical statistical interpretation of the black hole entropy 
and the generalized second law. 

2 Mutual information 

The localized entropy has divergences which cannot be renormalized in the context of QFT. However, 
these have a nice geometrical structure which leads to a very specific way to subtract them away. 

For the sake of analyzing the ultraviolet divergent terms we can use a pure global state. Thus we 
have the symmetry property S(A) = S(—A), where —A is the set complementary to A on a global 
Cauchy surface. One can freely change the cutoff and make it point dependent (for example using a non 
homogeneous lattice), but this equality has to be maintained. This implies that the divergent terms, 
which being ultraviolet are local, must depend only on the boundary surface of A, which is shared with 
—A. They are also extensive on the boundary because the entropy adds for independent variables. On a 
more technical level, the entanglement entropy S(A) corresponding to the spatial set A is proportional to 
the variation of the Euclidean free energy with respect to the introduction of conical singularities at the 
boundary [10]. More precisely 



where Z(A, a) is the Euclidean partition function of the theory on a space with conical singularity of 
angle 2na at the boundary of A (for a = 1 there is no singularity). As a result, the ultraviolet divergent 
terms in the entropy S(A) are local on the set boundary, where the manifold is singular. In d spatial 
dimensions, we then have an expansion of the form [TT] (see also [12] ) 



where So (A) is a finite part, e is a short distance cutoff, and the gi are local and extensive functions on 
the boundary dA, which are homogeneous of degree i. The leading divergent term coefficient gd-i [9 A] 




(3) 



S(A) = g d _ x [8A] e-^ 1 ) +... + <& [dA] e' 1 + g [dA] log(e) + S {A) 



(4) 
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is proportional to the (d — 1) power of the size of A. This fact was recognized since the earliest papers 
on the subject and it was thought as an area law for the entanglement entropy [21 [13], which mimic and 
pointed to a common origin with the black hole entropy formula. However, for i > depends on the 
regularization procedure and ga-i is not proportional to the area if this later is not rotational invariant. 
This term is not physical within QFT since it is not related to continuum quantities (this divergence might 
be absorbed in the renormalization of G in quantum gravity [33]). The same can be said of the subleading 
divergent terms, excepting the logarithmic one, which has universal (cutoff independent) coefficient [1 lj . 

Thus, to produce a finite physical quantity out of S{A) we have to subtract the boundary contributions. 
As a bonus, this would automatically eliminate the problematic negative contact terms in the entropy 
which appear for theories containing gauge fields or non-minimally coupled scalars [15j . Two different 
ways to make this subtraction naturally suggest themselves. The first one consists in subtracting the 
entropies for two different global states, S(pf-) — S^p^) as done in [7J. The ultraviolet behavior should 
not depend on the state and must cancel in this combination. A variation of this is 



This particular combination is a well known quantity called the relative entropy S(p\\p2) between two 
states pi and p2- It has good properties which resemble the ones for the entropy. In particular S(pf \p% ) > 
0. It can be argued that this inequality for the particular case when pf is the reduced density matrix of 
the vacuum provides a version of the Bekenstein bound in flat space which is well defined and true. 

There are two different kinds of problems with this type of regularization. The first one occurs even 
for a fixed non dynamical metric. One uses here a reference state to define the local entropy of other state. 
This may be natural in flat space where one may subtract the entropy of the vacuum, but in the general 
case where there is no standard vacuum nothing dictates the election of a particular reference state. The 
second problem is that when the metric is dynamical two different states live in different geometries. This 
introduces an ambiguity in the region which has to be considered for each density matrix. To see how bad 
this uncertainty can be, consider a nearly flat case with an object of mass M enclosed in a region V of 
typical size Ry S> MG, and compare its entropy to the entanglement entropy of the vacuum in V. The 
uncertainty in the area produced by the Newtonian potential is 5 A ~ jlf^A. The corresponding entropy 
uncertainty considering a Planck scale cutoff is SS ~ SAG^ 1 ~ MRy, which has to be much smaller 
than the actual renormalized value of the entropy S obtained from the subtraction of the entanglement 
entropy of the vacuum. This gives S 3> MRy, which is just the opposite of the Bekenstein bound[J. This 
problem is eased if we take a cutoff distance larger than Planck scale, but still this type of subtractions 
do not lead to well defined localized entropy in the semiclassical case. 

Thus, we need a function of one state rather than two. To eliminate the divergences using only one 
global state we crucially take into account that they are extensive on the boundary. We then consider 
two different nonintersecting sets and compute |17] 



We pay the price that our entropy function depends now on two entries but this is a one state function 
and a regularization independent measure of entanglement. Note that the divergent boundary terms get 
subtracted in I (A, B). In consequence, finite terms in the function S(A) which are local on the boundary, 
like the area term, should be considered like gauges, since they do not modify the physical quantities like 
I (A, B). To define S(A U B) the two spatial sets A and B have to be spatially separated to each other in 
order that A U B is included in a Cauchy surface. In flat space, if the global state is the vacuum, I(A, B) 
is a characteristic property of the particular QFT theory. 

lr rhis is somehow a reverse argument to the one given by Bousso in [16], which proves the Bekenstein bound from the 
FMW bound [5]. 



S(pf\p£) = Trp^logpi - Trp^logp? . 



(5) 



I(A, B) = S{A) + S(B) - S(A UB) = S( PA ® Pb\paub) • 



(6) 
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Figure 1: The spatial surfaces A and A' correspond to the same causally complete region A (diamond 
shaped set), while B or B' are Cauchy surfaces for the causally complete region B. 



The locality of QFT leads naturally to the formula (|6|) in order to extract the universal part contained 
in the entropy. Curiously, the same quantity has independently been studied long before in statistics 
and information theory. It was named mutual information and has the meaning of a measure of the 
information shared between the two different systems (see for example |18j). 

For a pure global state we have S(A) = S(—A), and it follows from ([6]) that I(A, —A) = 2S(A). Thus, 
in QFT when B tends to cover the space outside of A, the mutual information I (A, B) tends to infinity, 
somehow reproducing the divergences which are present in S(A), but in a cutoff independent way. In 
fact, in general divergences appear in I(A, B) whenever the boundaries of the two sets touch each other. 

On the mathematical side, the mutual information can be defined rigorously in QFT when A and B 
are separated by a non zero distance (see however section 5 below). In the axiomatic approach there is 
no local Hilbert space of states attached to each region O, but rather a local algebra Ao of operators 
generated by the quantum fields. Then the mutual information can be defined using the Araki formula 
[19] for the relative entropy of a pair of states in an operator algebra (here paub and pa <8> Pb in the 
algebra Aaub [20])- 

Thus, when we write I(A, B) the sets A and B just identify the corresponding local operator algebras 
in the QFT. This gives a bonus. Due to causality, the algebras in QFT are generally attached to causally 
complete regions. Typical examples are diamond shaped regions as shown in Figure 1. This means that 



where A' and B' are any spatial sets which are Cauchy surfaces for the same D-dimensional set as A and 
B. In other terms, the mutual information is a function of the equivalence classes of spatial sets having 
the same causal completion, or what is the same, a function of causally complete regions. This forces 
us to think in space-time terms. The same requirement for the entropy would be that S(A) = S(A') 
for equivalent spatial sets A and A'. This can be guessed true from the unitarity of evolution, but the 
application of this relation may be hindered by the necessity of regularization. 

As an application of the mutual information, we may give an argument to the effect that the entan- 
glement entropy cannot be finite in QFT, independently of the details of the high energy sector. In |21j it 
was shown that if the entanglement entropy is finite and Lorentz invariant it must be exactly proportional 
to the boundary area. The proof is based on the strong subadditive property of the entropy, and involves 
crucially short and long distances of arbitrary size in Minkowski space. An entropy proportional to the 
area means that the mutual information is identically zero, I(A, B) = 0. This simply cannot hold since 
the mutual information is a low energy quantity which is non zero in QFT (see the examples below). The 
resolution of this tension in QFT implies divergent entanglement entropy. 

Next we review some mathematical properties of the mutual information which are appropriate to an 
information measure (for reviews of the properties of entropy and related quantities see for example |22j). 





(7) 
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We also show some explicit examples in flat space-time. In the following sections we begin to investigate 
to what extent it can be considered as a useful quantity which replaces the localized entropy for relativistic 
theories. 

2.1 General mathematical properties of I(A,B) 

In a general quantum system (not necessarily the local reduced density matrices in QFT), the function 
I (A, B) is symmetric, dimensionless and positive 

I{A,B)>0. (8) 

For product state (A and B statistically independent), paub = PA® Pb, the mutual information is zero 
I (A, B) = 0. It has the remarkable property of monotonicity with respect to size, 

I(A,B) <I(A,C), BCC. (9) 

This means that the mutual information is smoothly varying with the set, and that the sharpness in 
the definition of the boundaries is not very relevant to it. This property is a consequence of the strong 
subadditivity of the entropy [23J 

S(A) + S{B) > S(A U B) + S{A n B) , (10) 

but hold in the more general case in which S(A) cannot be defined, as in QFT [19] . We have also the 
general inequality 

I(A,B) < 2mm(S(A),S(B)). (11) 

We can define a form of tripartite information (somehow information shared between B and C with 
respect to A) 

I(A\B, C) = I{A, B) + I(A, C) - I(A, BuC). (12) 
Because of the definition ([6]) this function has complete permutation symmetry 

I(A, B, C) = I(A\B, C) = S(A) + S(B) + S(C) - S(A UB) — S{A U C) - S(B UC) + S(AU BuC) . (13) 

From (|12p we see that I(A, B, C) measures the degree of non-extensivity of the mutual information, in 
the same way as I(A, B) measures the degree of non-extensivity of S. However, I (A, B, C) can have both 
signs. It is zero when the total state is pure but the general condition which makes the mutual information 
extensive is unknown to the author. 



2.2 Some examples in Minkowski space 

The explicit calculation of the entanglement entropy and the mutual information involves a partition 
function in a non-trivial manifold which is generally difficult to evaluate. However, in some particular 
cases exact results can be obtained. Here we recall some known results with interest for the following 
sections. 

From the point of view of QFT the mutual information can be thought as a kind of correlator of set 
variables. One expects that for separating distances between A and B which are large with respect to 
their sizes I (A, B) decays exponentially fast for massive theories and as a power of distance for massless 
ones. For free theories it can be shown explicitly using the results of [24J and a lattice regularization that 
the mutual information decays as the square of the field correlator in the large distance limit and for the 
space-time dimension D > 2 [25]. For massless fermions that means 
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while for free massless scalars it is 

77/i »\ Gb(A,B) 

I(A,B)~ R2(D _ 2) . (15) 

The functions G(A, B) have the dimensions which make I(A, B) dimensionless, and are monotonously 
increasing with A and B. They depend on their relative orientation but are independent of their mutual 
distance R. For massive fields the power of R is replaced by an exponential. 

The opposite situation is one with two sets with boundaries very near to each other. The leading term 
can be inferred from dimensional analysis in this case. Take two sets A and B with parallel faces of area 
A, at a distance L much smaller than the other dimensions of the geometric arrangement. We then have 
[26] 

J(AB)=!«^5. (16) 
The dimensionless function k for free fields has been computed explicitly. For massive fields it is 



t 1 



D—2 poo poo 



"(*)= ~ - o- 2 m - I dy iy -^ dy 2 y°-*c[Jyl + yl) , (17) 

2 D - 3 7t— r(^) Jt Jo V v / 

where t = mL, and m is the fields mass. In the massless limit it reduces to a constant 

(D - 2) 2 D - 3 vr V r (^) ) dy y D ^ c(y) . (18) 

The function c(t) = (riBCB(t)+nFCF(t)), where n# and np are the multiplicity of the bosonic and fermionic 
degrees of freedom, and ce(i) and c_p(i) are the two dimensional entropic functions corresponding to boson 
and fermion fields |26|, I27j . These are defined by the formula c{mL) = L dS(L)/dL, where S(L) is the 
entanglement entropy corresponding to an interval of length L in 1 + 1 dimensions [17] . The subindex B 
corresponds to a two dimensional real scalar field and F to a Majorana fermion one. These functions are 
given in terms of solutions of Painleve V nonlinear ordinary differential equations [28J . 

The analytic expression for the entropy has also been obtained for conformal field theories in two 
dimensions, where the n-sheeted space can be conveniently transformed. In this case the entropy for a 
set formed by p disjoint spatial intervals is [29] 



S = ~T ( ^ logd ( ai ' 6 J') ~ ^2 l ogd(ai, aj ) - 5^1ogd(6j,6,-) -p loge 1 , (19) 



i,j i<j i<j 



where dj and bi are the left and right endpoints of the i th interval, e is the ultraviolet cutoff, Cy is the 
Virasoro central charge, and d is the distance function in Minkowski space. Remarkably, this specific form 
of the entropy gives mutual information which is extensive |27| . 



I(A,BUC) = I(A,B)+I(A,C), (20) 

or we could also say distributive, I(UiAi,UjBj) = J2ij I(Ai,Bj). This property does not hold in general 
for massive theories or massless ones in higher dimensions as shown by (|15p . For two intervals A and B 
on the same plane of sizes I a and Ib, separated by a distance lc, we have 

/(A B) = X { uiA + h + lc) ) = X ( ^ ' (21) 

which is explicitly cutoff independent and conformal invariant since it is expressed in terms of a cross ratio 
r) |17| . The entropy and the mutual information in the conformal case can be written in null coordinates, 
u = x — t and v = t + x, as a sum of two independent terms, one for each coordinate, representing the 
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independent contributions from the right and left moving chiralities. For example, for two intervals we 
have (not necessarily on the same spatial plane) 

I(A, B) = ^ i og ( (»a + uc)(uc + ub)\ + ?v log ( {va + vc)(vc + vb)\ (22) 
6 V u c {u A + u B + u c ) ) 6 V v c {va + v B + v c ) ) 

where ux and vx are the null coordinate sizes of X, lx = \Jux vx ■ While (fl~9P shows that the functional 
form of the entanglement entropy is the same (excepting a global constant) for all two dimensional 
conformal field theories, this is not the case for conformal theories in more dimensions (see (j!4j) and (|15p ). 



3 Rindler space 

As an introduction to mutual information in black hole backgrounds let us first focus on the simpler case 
of a Rindler space, where we can use the results of the previous Section. The entanglement entropy of 
the Rindler wedge is of course divergent, proportional to the logarithm of the cutoff in two dimensions, 
and to A/e D ~ 2 in D space-time dimensions, where A is the transversal area of the wedge. 

Then consider the mutual information between the Rindler wedge A and another set B exterior to it, 
as shown in figure 2. Let us take a conformal theory since in the case there is mass I(A, B) will rapidly 
go to zero with the mutual distance Ic- 

First take the space-time dimension D = 2 and B a segment of length Ib- The equation (|2ip for 
I A — ► oo gives 

I(A,B) = ^-Au, (23) 
where v = arcth(i/x) is a boost parameter, and 

A, = log (24) 

is the v interval corresponding to the set B. We can also write 

Av = aAr, (25) 

where r is the proper time measured by an accelerated observer with constant acceleration a. Thus, 
according to (|23p the mutual information is proportional to the diamond Rindler's time At times a 
constant "flux of information" ^-a due to a constant flux of Unruh radiation [IT]. This gives an interesting 
meaning to the surprising extensivity property (|20p for the mutual information of conformal fields. 

Let us explore in more detail this relation between mutual information and Unruh radiation. The 
Unruh temperature measured by an accelerated observer of constant acceleration a is T = a/(2ir) and 
this observer is at a constant distance 1/a from the wedge vertex. Thus, in general we have a temperature 
T = l/(27rx), varying with space-time point, where x is the distance to the wedge. In fact, the density 
matrix for the half space is thermal with respect to the boost operator K 

p ~ e~ 27vK (26) 

rather than the Hamiltonian, with a dimensionless temperature (2tt)^ 1 . On the other hand, in the large 
volume limit the entropy per unit volume for a conformal theory is [30] 



ax 6 

If we use (heuristically) this expression for computing the entropy of the radiation in B we have 



Ic Jlc 



3 2tcx 6 V Ic 
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Figure 2: The wedge A and diamond set B in the exterior region. The curves of constant v are lines 
through the origin. 

which is exactly one half of the mutual information. 

The factor two could be ascribed to the fact that when the global state pab is pure (that is when 
A U B tends to the whole space) the entropy appears duplicated in the mutual information, I (A, B) = 
S(A) + S(B) = 2S(A). However, the coincidence of the mutual information and twice the entropy 
heuristically contained in the thermal Unruh radiation is a peculiarity of the two dimensional conformal 
case, where the wedge can be transformed to a Euclidean cylinder [10]. In general there is no justification 
for using the large volume limit entropy density in the calculation above. 

On the other hand, for a theory in two dimensions, the proportionality of I (A, B) with Rindler's time 
can be understood in the large Av limit. In fact, in this limit we can even define an entropy density 
without resorting to the mutual information. This is because the contribution of the boundary of B to 
the entropy is constant, since it always consists of two points, while the Rindler time during which the 
radiation is collected grows indefinitely. Thus, the boundary contribution can be neglected for the mean 
entropy per unit v. More formally, the extensivity in this limit can be shown as a consequence of the boost 
symmetry. In order to see this we follow the argument which was used in [3] for probing the existence 
of a mean entropy in the large volume limit. There the authors considered a translational invariant one 
dimensional system, with S(l) the entropy of a length I interval. Using the SSA inequality ()10p between 
one segment and its translation by an infinitesimal amount one gets that S(l) is a (positive) concave 
function of I, S"(l) < 0. This automatically guarantee that S(l)/l approaches a constant for large I. In 
the present case the same argument can be used for S(B) (or S(A U B)) as a function of the B Rindler's 
time Av, where now the role of translations is played by the boost symmetry. Note we have to use the 
SSA inequality here between sets which are not on the same Cauchy surface but rather they are time-like 
displaced by a boost [21]. This is not an impediment for applying the SSA relation for one dimensional 
conformal theories since one can interchange the meaning of the temporal and the spatial coordinates. 
In this case the function I(A, B) is extensive in the large Av limit since both S(B) and S(A U B) are 
extensive there (S(A U B) has mean entropy per unit v equal to 0). It is remarkable that the formula 
(|23p shows a mutual information which is proportional to Rindler's time also for small Av. The physical 
meaning of this fact beyond the explicit calculation in [29] is not clear to the author. In the next section 
we will encounter again the two dimensional conformal mutual information (|2ip . but we will be mainly 
interested in the large Av region. 

Now let us turn attention to Rindler space in more dimensions. Let us compare the naive thermal 
entropy of the Unruh radiation inside a given set with the corresponding mutual information between 
the wedge and the set. The formula (|16|) gives the leading term for the mutual information between the 
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wedge and a rectangular set with a large parallel face of area A and short separating distance R. This 
gives 

I(A,B)^ Kj ^. (29) 

On the other hand, the integration on the rectangular set of twice the large volume limit entropy density 
of a free massless gas gives 

2 l% dV ~ k W^' (30) 



where for each field degree of freedom it is 



_ i)(D _ 2)22(^-2) vr ^ ii r (^1) A) exp(A)±l' 

The plus and minus signs are for fermions and bosons respectively. Comparing (j29jl and ()30p we see 
that the leading terms have the same functional form. However, in general k is greater than k and the 
difference increases with the space-time dimension D. The table 1 shows some examples. 

An interesting consequence of these results appear in connection to the discussions around the Beken- 
stein bound and the concept of unconstrained thermal radiation [31] . This state of matter is supposed to 
maximize the entropy for a fixed temperature distribution in space. Thus the entropy in unconstrained 
thermal matter in the Rindler space can not be given by the integrated entropy density of a thermal 
gas with the local Rindler temperature. Prom the table 1 we see that the mutual information is always 
greater than twice the thermal radiation entropy, and already more than three times (twice) greater in 
D = 4. According to the inequality (jlip this implies a corresponding inequality for the entropy, for any 
regularization which respects the correlations responsible for the mutual information. One could wonder if 
there is a deeper cause for this 'unnatural' relation between the mutual information and the large volume 
entropy density, and if it subsists in the presence of interactions. 

The reason for the differences with the two dimensional case are clear. In D > 2 we do not have any 
valid argument to relate the formula (|30|) with the physics of the problem. The large volume limit does not 
apply since the thermal wavelength T _1 = 2ttx is of the order of the distance x to the system boundaries. 
The argument for defining an unambiguous entropy of B using the large boost time limit does not work 
here since the entropy we want to measure S ~ A/R D ~ 2 is smaller than the cutoff contribution due to the 
boundary ~ A/e D ~ 2 . Also, the strong subadditivity cannot be applied between time-like separated sets 
in dimension D > 2, and there is no proportionality to the Rindler time. The right way to unambiguously 
measure the entropy is using the mutual information, which differs notably from the naive counting of 
gas entropy. 

Consider now the opposite limit for B, that is, when its size is much smaller than the separating 
distance to the wedge. We take B to be a sphere and keep the fields massless. For the conformal case the 
mutual information between two spheres must be a function of the cross ratio rj = ^n|^~j|~~j^ > where 



Ra and Rg are the sphere radius and I is their separating distance. Then using the eqs. (fT4j) and (|T5l) 
between two distant spheres and making a conformal transformation to map the sphere A to the wedge 
we get that I ~ {Rb/1) D2 for bosons and I ~ {Rb/1) D1 for fermions. We see the bosonic case has a 
different qualitative behavior than the integrated entropy density of a thermal gas, which has a functional 
form in this limit coinciding with the fermionic one. This recalls that, in contrast to the existence of an 
entropy density, in general we do not have extensivity for I(A,B). 

However, it is the two dimensional case what we intend to extrapolate to the black hole space-time 
in any dimensions. In the black hole analogy of the Rindler space Av must be interpreted as At/(4M), 
where t is the time as measured by the asymptotic observers and M is the black hole mass. Thus, I (A, B), 
where A is the black hole, would be proportional to the asymptotic time, corresponding to a constant flux 
of Hawking radiation [T7] . 
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D 


K 


k 


2 


1/3 


1/3 


3 


3.97 10- 2 


2.90 10~ 2 


4 


5.54 l(T a 


1.77 10~ a 


5 


1.31 10~ a 


1.69 10" 4 


6 


4.08 10~ 4 


2.01 10" 5 



D 


K 


k 


2 


1/6 


1/6 


3 


3.61 10- 2 


2.18 10" 2 


4 


5.38 10" 3 


1.55 10" 3 


5 


1.30 10~ a 


1.58 10" 4 


6 


4.06 10" 4 


1.95 10~ b 



Table 1: The coefficients k and k for different space-time dimensions D. The table on the left refers to 
bosons and the one on the right to fermions (one field degree of freedom). 



4 Evaporating black hole 

Is the semiclassical entropy of a black hole finite?. Both possible answers to this question can be encoun- 
tered in the literature, of course referring to different quantities. 

First consider the Cauchy surface Si in the figure 3(a). The space-time in this figure represents a 
black hole which forms and subsequently evaporates completely. Near the horizon of a large black hole 
the space-time is nearly flat, and the horizon resembles a Rindler horizon. If we trace over the degrees 
of freedom lying inside the black hole on Si, the entanglement entropy diverges with a leading term 
proportional to the area!!. 

On the other hand the arguments in [32} [33] reveal a reason why the entanglement entropy of a black 
hole must be finite. This is because the black hole has a fixed mass, which provides a cutoff to the 
fluctuations responsible to the entropy. In fact the total entropy radiated by a black hole is of the order of 
and greater than the Bekenstein entropy A/ (AG) [34], quite independently of the way the black hole was 
formed. If the global quantum state is pure this entropy should equal the entropy inside the black hole. In 
figure 3(a) this entropy corresponds to either the reduced state inside the whole black hole region (region 
I) or the state reduced to one of the Cauchy surfaces S2 for the final region, where the black hole has 
evaporated completely (this is the region of the space-time space-like separated to the whole black hole). 
It is very remarkable to find a proper region of the space-time which has finite entanglement entropy. One 
could be even tempted to define a black hole as a causal region with this property. Unfortunately, the 
exact nature of this entanglement entropy is outside the semiclassical analysis since it depends crucially 
on the details of the final point of evaporation, which gives the contact surface between the black hole and 
the region III. It is a necessary condition for finiteness that this point has zero area for example, what is 
not the case if there is a remnant or in the evaporation of charged black holes in certain theories where 
the final state is an extremal black hole. In any case, the entropy in region III should grow as M 2 , and 
even if it has a divergent part, it must be a constant independent of the initial black hole mass, and thus 
could in principle be isolated. 

The reason why the entropy in the whole Hawking radiation is finite while the near horizon fluctuations 
have infinite entanglement entropy is simply that most of these fluctuations fall back to the black hole 
and do not reach infinity [33]. They constitute what is called the thermal atmosphere which surrounds 
the horizon of the black hole. 

The definition of a continuous, time-dependent entropy, connecting the divergent entanglement in 
Si with the finite entanglement in S2 may require quantum gravity effects. These would regularize the 
divergent entanglement giving place to the sum of two terms for the Cauchy surface Si. One is the 
Bekenstein Hawking black hole entropy A/ (AG), which is a geometric quantity depending on the horizon 
surface, and should come from the regularized near horizon divergent terms in the entropy. The second 

2 One can be reassured that no global effect can change this divergent character by looking at the mutual information of 
two small sets on a nearly flat region on each side of the horizon. For short separating distance this mutual information 
increases without upper bound. The monotonicity of the mutual information shows that the conclusion is not changed for 
larger regions. 
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(a) 



(b) 



Figure 3: (a) Two different spatial surfaces Si and £2 on the space-time of a black hole which completely 
evaporates, (b) The set A is placed inside the black hole and B is in the radiation region. 

one would consist in the entropy contained far from the black hole in the Hawking radiation. According to 
the generalized second law this sum should always increase with time [35]. At the end of the evaporation, 
on £2, the only remaining entropy is in the radiation, which naturally has to be greater than A/ {AG) 
for a non equilibrium process. However, there is also the controversy (information loss paradox) over the 
possibility that quantum gravity effects could restore the purity of the final (exterior) state [36]. In this 
case the fine-grained entropy of the total radiation would be zero (for pure initial states). 

Therefore, at the semiclassical level there is no time dependent entropy, and at least no evident 
microscopic explanation to the Bekenstein-Hawking entropy (we will come back to this point in section 
V. See also [37] an references therein). Worse still, even if the total radiated entropy can be defined in 
the asymptotic future, in general there is no unique way to separate the entropy of the Hawking radiation 
in a surface like £1 and it is not possible to determine where this entropy is localized. The problem 
resembles the one encountered for the Rindler space, but the wavelengths of the relevant modes are here 
much smaller than the typical distances involved. On the other hand, the gas is much more diluted than 
in Rindler's case, making the entropy definition also problematic. This problem of entropy localization is 
also present for highly diluted states and is of a general nature. We can encounter the same difficulties in 
the radiation of a sufficiently cold object. In order to see this, consider an object of radius R, and let T 
be its surface temperature. Take a spherical shell of radiation of outer radius d which escaped during an 
interval of time At < d, much larger than the typical wavelength T _1 of the radiation, and smaller that 
the cooling time. Assuming the radiated entropy is extensive on time (or radius) we expect the entropy 
in the shell to be Ssheii ~ T^R 2 At < T 3 R 2 d, corresponding to an entropy flux in massless particles of 
order T 3 i? 2 . However, the regulated entanglement entropy of the shell has a contribution from the shell 
exterior surface which is proportional to the area in units of the cutoff squared. Choosing a distance 
cutoff of maximum size for the present problem, that is, of order of the radiation wavelength T _1 , the 
boundary contribution makes the entropy uncertain in 5S ~ d 2 T 2 . To extract a well defined localized 
entropy, it seems that we need d <C T R 2 . For a sufficiently large d this inequality cannot be maintained. 
For example it is violated for the sun radiation for d > 4 10 7 parsec and it is always violated for a black 
hole. One can still argue that due to the different behavior of the terms S s ^ e \\ and 5S with the size d, it 
could be possible to separate them. However, even in that case, the area term gives uncertainties to the 
linear term in d. This is due to the uncertainty in the definition of d which has to be of the order of our 
cutoff scale ~ T _1 . Thus, the naive iSgheii should be compared with the error in SS, of the order of dT. 
This gives (RT) 2 ^ 1 in order to extract an unambiguous local entropy. This is of course not the case 
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for the black holes which are very cold objects, having an inverse temperature bigger than their sizes. 

It is then of interest to explore the possibility that the energy flux (the energy momentum tensor) is 
not giving a good indicator of the entropy carried by the radiation. In the following we try to answer 
this question by studying the mutual information in a simple model and approximation for black hole 
evaporation. The mutual information gives a natural way to eliminate the area terms an divide what 
is nearby fluctuations from long range correlations. However, it requires specifying the other region to 
which the information is shared. Thus, a natural and well defined time-dependent entropy is given the 
mutual information between the black hole, or a part of it, and a region far away from the black hole, 
where the Hawking radiation is observed. The mutual information is also useful for determining where 
is located inside the black hole the information shared with the Hawking radiation. Thus, we consider 
I (A, B) where a typical placing of A and B is as shown in fig. 3(b). 

As mentioned, I(A,B) is increasing with A and B. Then it is bounded above (modulo contributions 
from stable remnants) by the mutual information between the whole black hole and the complete region 
III. If the initial state is pure this is just twice the total radiated entropy, XA/(4G), where A > 1 depends 
on the details of the theory. Also, the monotonicity property implies that the mutual information of 
the black hole and some subset of region III can only increase with the size of this later. Therefore it 
is not possible that correlations established during the whole evaporation period make the state in this 
region pure once some non zero entropy has developed at the early stages. Solutions of this type to the 
"information loss paradox" then would require quantum gravity. In particular, effects such as quantum 
fluctuations of the horizon position [38], or the failure of the Hilbert space in AuB to be the tensor product 
of the ones in A and B [39J, could invalidate this reasoning. But note that ordinary QFT interactions, 
and in particular interactions between the outgoing modes with the falling matter, cannot. 



4.1 Two dimensional approximation: extensivity in the radiation region 

To be concrete, in this Section we consider a spherically symmetric large black hole formed by the collapse 
of a thin null shell, which subsequently undergoes evaporation, as shown in the figure 4. We are interested 
in the relation between the black hole and the asymptotic radiation, and the issue of the final stages of 
evaporation will not be relevant here. 

Let us briefly describe the geometry [40J . The spherically symmetric metric after the shell passing can 
be written in the quasistationary regime of evaporation with an advanced time dependent mass m(v) 3> 1 
to a very good approximation [41] 

ds 2 = (l_ dy 2 _ 2dvdr _ r 2 {d Q2 + sin 2 (0 )#2) ^ (32) 

where v is a null coordinate and 47rr 2 measures the area of the two spheres. As the black hole evaporates 
the time dependent mass obeys the energy conservation equation 

*i (33) 
dv m(v) z 

Here k is a rate of evaporation which depends on the field content of the theory. Eq. (|32p can be written 
in terms of null coordinates 



ou 



dudv - r 2 (u,v)(de 2 + sin 2 (6)d<j) 2 ) , (34) 



where r(u,v) is given by the equations 

1 / 2m(v) 



Or 
dv 



2 



(35) 
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Figure 4: Penrose diagram for the black hole formation and evaporation. In the text the mutual infor- 
mation I(A, B) between A and B is calculated in a two dimensional approximation. 

and Jjjl = -5 on the asymptotically flat regions / + and I~. We take v s heii = 0. The region prior to the 
shell transit is flat and the metric can be written as 

ds 2 = dU dv - r 2 (U,v){d9 2 + sin 2 (6) d(p 2 ) , (36) 

with r(U, v) = (v — U)/2 with a different null coordinate U, and we have U = v on r = 0. The continuity 
of the metric along the shell v = gives the relation between null coordinates 

r(u,0)=~. (37) 

In order to find the relation between u and U we find the trajectory of a light ray near the horizon. 
In this region we expand in 5r(u,v) = r(u,v) — 2m{v) <C 2m(v). The solution for the outgoing radial 
trajectory u =constant to leading order in m(0) is exponentially increasing 

1 rv dv 

5r = 5r e 4h m W . (38) 

This exponential redshift holds roughly until the radial coordinate distance to the horizon reaches the 
size of the horizon radius Sr ~ 2m(v). After that point we can use the asymptotic relation r = (v — u)/2. 
This occurs at small r compared with both the u and v which interest for the future asymptotically flat 
region, 

u — v ~ 8m(v) <C v . (39) 
Thus, using (|37[) . we obtain the well known exponential redshift relation between null coordinates 

1 f u dv 1 

^U h -U) = --l — r - (m (uf- m(0 f). (40) 

m(0) 2 

This applies in the late time region, and well before the black hole reached Planck mass size, e sk < 
U h -U <€. 4ro(0). 

Now we proceed to compute the mutual information in this geometry. We will use the two dimensional 
approximation to black hole evaporation |42j . This consists in replacing the (massless) fields in the four 
geometry by a conformal field in the two dimensional geometry given by the radial-temporal part of the 
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spherically symmetric metric. To mimic the four dimensional case a reflecting boundary condition for 
the two dimensional field is imposed at r = 0. The justification of this approximation is that the most 
important contribution to the Hawking radiation comes just from a few field modes, specially the one 
with the lowest angular momentum. However, the conformal field in a conformally flat two dimensional 
metric neglects the backscattering by the geometry. Still, the effective dimensional reduction gives a good 
qualitative understanding of the evaporation process and of the character of information transmission as a 
one dimensional channel [33]. For the entanglement entropy it is not expected to give a reasonable result, 
since it neglects the large angular momentum modes which lead to the divergent term proportional to the 
area/e 2 , with e a distance cutoff (one massless field gives only a logarithmic divergence ~ loge). However, 
we still expect it to give a reasonable qualitative approximation to the mutual information between the 
black hole and a distant region, since this is mainly due to the correlations between the black hole and 
future infinity, which are approximated by the two dimensional field. The analysis which follows is closely 
related to the discussions in [44J and |45j, which we hope it completes in certain aspects. In particular, our 
use of the mutual information justifies the claim in [45J that the two dimensional entanglement entropy 
is the relevant one for the higher dimensional case. 

The Hawking radiation is then determined by the two dimensional conformal anomaly, and the mass 
loss rate coefficient is given by |46j 

< 41 > 

with Cy the Virasoro central charge of the conformal theory (Cy = 1 for a massless scalar). 

Due to the propagation along null directions in a conformal two dimensional theory the local algebras 
of field operators can be thought attached to regions of the null axes. In the present case, due to the 
reflecting boundary conditions at the origin, the two null propagating directions are mapped into one 
[44j . A causal set in space-time has two past null shadows, one which goes directly to the past infinity 
I~ and the other goes to the origin first, and then is reflected to reach I~ (see figure 4). Thus to each 
single space-time interval it generally corresponds two disjoint segments at I~ and the operator algebra 
corresponding to this subset of the v axes. Since the state at I~ is the Minkowski vacuum we can use 
formulas (|20p and (122p (with only one chirality contributing) to evaluate the mutual information. In 
the example of figure 4, the quantity I(A, B) can be decomposed into four different terms of mutual 
information between single intervals as 

I(A, B) = I{A U B X ) + I(A 1 ,B 2 ) + T(A 2 , B x ) + I{A 2 , B 2 ) . (42) 

Under the conditions of the validity of the two dimensional approximation, which requires observing 
the Hawking radiation far from the black hole, the contribution from I(Ai,B 2 ) and I(A 2 ,B 2 ) can be 
neglected. Also, the size of B\ and its v coordinate distance from vh are exponentially small, which in 
general allows to neglect I(A 2 , B\). In order to have a non negligible mutual information, A\ should 
be also exponentially near vh- Let us then consider the case where A contains a piece of the horizon, 
^3 = V H- This gives 

Cy 

I(A, B) = — {\og{v H - vi) + log(w 4 - v 2 ) - \og{v H - v 2 ) - log(i> 4 - v x )) 

D 

~ 16 7T (m( Ul ) 2 - m{u 2 ) 2 )) = A(ui) - A{u 2 ) , (43) 

up to contributions logarithmic in the black hole area A. Here u\ and u 2 are the asymptotic times of 
B, corresponding to U\ = v\ and U 2 = v 2 through (|40p . We have assumed that the width of A is not 

m(0) 2 

exponentially small, V4 — vh ^> e . 

Thus, the mutual information between the black hole and a region between two asymptotic times is 
proportional to the differences of black hole areas in Plank units at the corresponding times. This is 
four times the Bekenstein entropy. As we mentioned, one factor two comes from the definition of mutual 
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information, which measures twice the entropy if the state in A U B is pure. The other is due to the fact 
that in two dimensions the total radiated entropy in the conformal case is exactly twice the Bekenstein 
value (see for example [35]). 

Surprisingly, the mutual information turns out to be extensive on the radiation region, being dis- 
tributed as dictated by the entropy density of a thermal gas with the appropriate energy flux. If this 
continue to hold in the full four dimensional calculation, it would reinforce the idea that it is the correct 
measure of localized entropy in relativity. However, the extensivity in the two dimensional problem may 
not give a full account of the subtle information sharing in the higher dimensional space-time. In particu- 
lar, even if the correlations between A and B are approximately taken into account, the field correlations 
in this model inside A or B are not realistic, and this may significantly change the results. Also, the 
angular distribution of information is not accessible to the two dimensional problem. Thus, we cannot 
give a definite answer to the interesting question of information extensivity here. We hope to come back 
to this question on a future work. In any case, we remark that, contrary to the entropy, we do not know 
of any principle which implies spatial extensivity for the mutual information. 

The explicit expression for I(A, B) also allows us to find the location inside the black hole of the 
information shared with the Hawking radiation. This is contained in any exponentially thin AU > 
g-m(o) /(8fc)^ nu rj s j ie ]2 ) exponentially near to the horizon, \v^ — Vh\ ^ e~ m ^ /( 8fc ). No appreciable change 
in the mutual information is derived by including deeper regions in the black hole. Also the shell can be 
exponentially short in Av without appreciably modifying the outcome. In particular we could take A to 
be only an exponentially small neighborhood of the point H where the horizon begins (see figure 4), well 
inside the flat space region. The mutual information of this neighborhood with all the hawking radiation 
would be of order A/G. On the other hand eq. (jlip implies that the entropy in A is bounded below by 
I(A, B)/2. This means that the entropy bounds get violated by a very large factor near H (see also the 
discussion in [9]). 

One could argue that this is no surprising since the entropy diverges semiclassically. However, the 
mutual information is finite and cutoff independent, and one can expect it to be stable against Plank 
scale physics in the macroscopic domain. The sets A and B can be taken macroscopic and separated by a 
very large distance to each other. Thus, if the entropy bounds hold we would like to know the mechanism 
which makes the semiclassical calculation above not reliable. With this aim the role of backscattering is 
explored in the next section. Also, the interactions may impede the mode extrapolation to the H point, 
though the modes are at an extremely high frequency in this region and one should be capable to rely on 
the geometric optics there. 

A final commentary to this section. The mutual information forces us to think in four dimensional 
terms [17] , and this brings up some common misunderstandings related to the semiclassical description of 
black hole evaporation. First, the time evolution is perfectly unitary for the whole space-time, and there 
is no loss of information. The entropy seen for asymptotic observers is due to information sharing with the 
region inside the black hole. Also, sometimes it is stated that the information responsible for the entropy 
is destroyed at the black hole singularity. We think this is misleading, since for the external observer the 
relevant information is eternally kept in the black hole interior, and even, in this dimensionally reduced 
calculation at least, it is retained in a small place of flat space in the region before the black hole formation. 
The black hole then acts as form of remnant trapped in a different time (the presence or not of a real 
stable remnant is then mostly irrelevant in this sense). Third, it is sometimes argued that the entropy in 
the radiation evaporated from a black hole, since it depends on its whole past history, can be arbitrarily 
large, independently of the present size of the black hole. If the initial state is pure this entropy should 
then be the same as the black hole one, being inconsistent with its value being ^4/(4G). However, this 
again depends on what is taken as the entropy of the black hole. The mutual information shows that the 
correlations responsible of the Hawking radiation correspond to much earlier times, and if the black hole 
entropy is related to a regularization of the surface divergent terms [48] (and not to all the entropy in the 
black hole region), there is no contradiction for it to take the Bekenstein-Hawking value. 
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4.2 Localization inside the black hole: including backscattering 

In this section we include the backscattering by the Schwarzschild geometry of the four dimensional 
problem for free massless fields. We focus attention on the distribution of information inside the black 
hole, and our primary interest is to check if a large amount of information is still contained near the 
neighborhood of H when backscattering effects are taken into account. 

We use the quasistationary approximation which reduces the problem to one of a scattering in the 
Schwarzschild geometry of mass M. To introduce the notation consider first a massless scalar field. We 
decompose it into modes of definite angular momentum I and frequency w 

u=^-e- iwt Y lm (9,c/>). (44) 
r 

The radial function satisfies the equation 

JL + W 2- V (r)), (45) 

with r* = r + 2M log(^=f^) and the potential 

, , / 2M\ ( 1(1 + 1) 2M\ , . 

v <r) = { 1 -—) + —) ■ ( 46 ) 

The scattering on the potential barrier, which vanishes for r* — > ±oo, determines the evolution of the 
original Unruh vacuum state. The radial function corresponding to the so called up modes behaves as 

fF(T*,w) ~ e iwr * +n(w)e- iwr * for r*^-oo, 

fr(r*,w) ~ ti{w)e iwr * for r* -» oo , (47) 

where ri(w) and ti(w) are the amplitudes for reflection and transmission through the barrier, with 
\fl (w) | 2 + |t; (if) | = 1. The modes relevant for the Hawking radiation for late time observers are very high 
frequency when propagated back to the flat region. They are entangled with the very high frequency int 
modes which can be described by wave packets of exponentially small size entering the black hole in the 
region near H. The relevant contribution to the Unruh vacuum state (which is the relevant one for black 
hole evaporation), in the subspace of definite energy and angular momentum, is 



|0)=n y/l - e" 8 * e-* nMNw \N, up) (8) \N, int) , (48) 



N=0 



where N is the occupation number and some wave packet discretization and normalization in the modes 
is assumed (for details see for example |42j). 

Now, according to (|47p . the up modes split into out modes, which go to infinity, and the down modes, 
which fall through the horizon (at much later times than the int modes) . Accordingly, the up modes write 

N , v i 

|iV,up)=£( j ti{w) p n(w) N - p \p, out) (8 |JV - p, down) . (49) 
p=0 ^ ' 

The relevant Hilbert space for the radiation gets then factorized as TL = Tl mT 'Si'H OUT (^Ti. DOWN . Then, 
one can compute the reduced density matrices (for each mode of definite w and I, we suppress these 
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indices for notational convenience) 



<x> 



p mT = (l-x)^2x N \N, int) (N, int| , (50) 
N=0 

Pout = (1 ~ x) ^ ) ^ |iV, out) (AT, out| , (51) 

« „ (i - (i -r)x) 



AT=0 



Pdown 



" x ) E 7TTF V+i 1 ^ down) (iV ' down| ' (52) 



with x = e - SnMw and F = \t[(w)\ 2 is the transmission coefficient. 

These formulas are also valid for fields having non zero helicity, where F is obtained from the adequate 
differential equation [39). Also, similar expressions hold for the fermionic case. We can treat both cases, 
fermionic and bosonic, in a unified way by defining the functions 

s±(x, r) = ± log(l ± x) — (Fx logCTx) ± (1 ± (1 - T)x) log(l ± (1 - T)x)) , (53) 

1 ± x 

where the plus (minus) sign corresponds to fermions (bosons). Then the entropies per mode are given by 

S mT (w,l) = s±(x,l), (54) 
S OVT (w,l) = s±(x,F), (55) 
S DOW n(w,1) = s±(x,l-r). (56) 

When these entropies are summed over the angular modes both, 5 INT and S UOWN , diverge. The S'qut 
however is regulated due to the sharp fall of transmission coefficient with /. Thus the total radiated 
entropy 'per unit time' can be computed 



dS c 



dt 27T 

l=o 



- V(2/ + l) / dws ± (x,T). 
]7T t^ Jo 



(57) 



We have raised doubts on the interpretation of this quantity as an entropy density, but when integrated 
over time with a slowly varying temperature and T which incorporate the change in the black hole mass 
during the evaporation, it must give an excellent adiabatic approximation to the total entropy radiated. 

In consequence, to explore the distribution of information inside the black hole we cannot rely on Sint 
or Sdown- We can use instead the mutual informations between the asymptotic future / + , with Hilbert 
space Wout; with the near H region, with Hilbert space TCi NT , and the rest of the black hole, which we 
call H = H - N, with Hilbert space Wqqwn- The expressions of the mutual information per mode are 

I(^,I + )(w,l) = S mT (w,l) + S OVT (w,l) - S DOWN (w,l) , (58) 
I(H,I + )(w,l) = S DOWN (w,l) + S OVT (w,l) - S mT (w,l) , (59) 
I(H,I + )(w,l) = 2S OVT (w,l). (60) 

Now, we can use the expression analogous to ([57|) to compute the full mutual information functions 

dt—J2( 2l + 1 ) dwl(w,l). (61) 
71 i=o ^° 

This mode by mode calculation is possible because of two reasons. First, we are considering the whole 
asymptotic region I + , so no problem with the boundaries of a finite region can arise. Also we are 
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h 


5*out/ Sbh 


J(N, J+)/(2S B h) 


I(H,I+)/(2S BB ) 





1.88 


1.49 


0.40 


1/2 


1.64 


1.35 


0.29 


1 


1.50 


1.27 


0.23 


2 


1.35 


1.19 


0.16 



Table 2: Numerical results on the mutual information between the asymptotic future I + (containing the 
Hawking radiation) and two regions in the interior of the black hole, the neighborhood of the point H 
where the horizon begins, and the rest of the horizon H (for later times). The results for the evaporation 
using different massless field helicity h are normalized to twice the Bekenstein entropy Sbh- The sum 
of the last two columns equals the first one. This later gives the total entropy radiated and reproduces 
results in [53] . 



considering a fixed H region, whose actual size, smaller than the black hole radius and bigger than 
exponentially small distances, is not relevant. This means that if correlations between H and H change 
their mutual informations with the distant / + region they do it by a fixed quantity, much smaller than 
the black hole entropy. 

The expressions (I58H60|) can be compared with the black hole Bekenstein entropy loss due to loss of 
energy 

^ 4 Mg (2i + 1 )f^. ,62) 

Comparing the mode contributions we always have 

S OU t > Sbh (63) 

as corresponds to the generalized second law |34j . This is true for an arbitrary T function and just 
expresses that the out of equilibrium radiation process from a thermal reservoir must increase the total 
entropy. 

Remarkably, the total amount of mutual information with the initial aleph point is kept finite, for an 
exponential small set This large violation of the entropy bounds is then not eased by backscattering. 
Moreover, we have 

>2S BH . (64) 

This is true for fields of any helicity, since 

s±(x, T) + s±(x, 1) - s±(x, 1 - r)) > 
rs|log(x)|/(l±x) - 

for any x £ [0, 1] and T £ [0, 1]. We will discuss the thermodynamical explanation for this as well as for 
([63]) in Section V. 

Note that since the global state is pure we have the partial extensivity 

I(H, 1+) = I(H, 1+) + 1+) = 2S OVT . (66) 

This and the fact that s-t(a;,r) is increasing with T lead to 

5 OU t</(H,/ + )<25 OU t. (67) 

Some numerical results for radiation by massless fields of different helicities in four dimensions are 
given in table II. The numerical methods are discussed in [50]. For the two dimensional approximation 
T = 1 and we have 5 OU t / Sbh = 2, /(H, / + )/ (2Sbh) = 2, as we found in the previous section. For a massive 
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field with mass m 3> (MG) , we have T ~ #(4Mu> — I) and this gives almost reversible evaporation |51j . 
S OVT ~ Sbh, but with an exponentially suppressed flux, 

dt it 2 y J 

In this case, and in any dimension, we have almost all the mutual information concentrated in the N 
region, J(N, /+) ~ 2S B h- 



4.3 Localization inside the black hole and conformal symmetry 

Given a pure state |0) in a bipartite system TL = TL\ ® TL2 we can choose basis in order to express it in 
the Schmidt decomposition [18] 

|0) =J2 X i\ a i)®\&) > ( 69 ) 

with ^ |Aj| 2 = 1, in a similar fashion as eq. (|48p . Then there is a one parameter unitary symmetry acting 
on the global Hilbert space given by 

U(s) = p? s ® p™ , (70) 

where 

Pi=^2\\i\ 2 \ai)®{ai\, (71) 
P2 = ^2\\i\ 2 \Pi)®(Pi\ , (72) 
are the reduced density matrices. The symmetry then leaves the state |0) invariant 

U(s) |0) = |0> • (73) 

Thus, in quantum field theory any decomposition of the space in two regions leads to a one parameter 
group of symmetries which leaves the vacuum invariant. This is one of the results of the Tomita-Takesaki 
theory [52] (modulo details of the mathematical definitions, since strictly speaking the reduced density 
matrices do not exist in QFT). This unitary group is called the modular group and s acts as a kind of 
internal time for the two regions. For any QFT, the Rindler decomposition of the Minkowski space in 
two half spaces leads to a modular group whose the action is given by the boost symmetries {53] (see 
(|26p ). In general however, the modular group does not have a simple action in terms of space-time point 
transformation on the field operators. 

Now, suppose that we have a further decomposition Ti\ = TLa 8) TLa an d Ti-2 = TLb ® Hg an d 
we are interested in the mutual information I(A,B). An element of the modular group will keep the 
decomposition TL = Ti\ invariant, but will carry these new decompositions into different ones by the 
unitary transformation Ti\ = TLa' ® H-ft an d TC2 = TLb 1 ® As the global state is kept the same we 
have 

I(A,B) = I(A',B'). (74) 

We want to use this property with the modular group corresponding to the space-time of figure 5, 
representing a black hole formed by the collapse of a null shell. One of the two sets in which it is 
decomposed the space-time corresponds to the diamond shaped set in flat space bounded by the horizon 
on the lower side and the incoming null shell on the upper one. As the state is the Minkowski vacuum 
the action of the modular group in this region is a flat space problem (it depends on p\). In general the 
action of the modular transformations is not known, an exception being the case of conformal theories, 
in which the diamond can be transformed to the Rindler wedge. We have that the modular group in this 
case acts point transformation given by [54] 

X' ± = 2M ((2M + X±) - e 27rs (2M - X±)) ((2M + X±) + e 2ns (2M - Xi))^ 1 , (75) 
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Figure 5: Black hole formed by the collapse of a null shell. The region I below the shell and above the 
horizon is a diamond shaped subset (intersection of the past of a point and the future of another point) 
of Minkowski space. Region II contains all points space-like separated from region I. The arrows show 
the trajectories of the modular group. It acts geometrically in region I for conformal fields and in the late 
time region II if thermal equilibrium is achieved. 



where X + = t — r + 2M and X- = r + t + 2M are null coordinates, and 2M is the radius of the diamond 
(Schwarzschild radius). The origin of time was set such that v = r + 1 = for the incoming shell. Thus, 
we have chosen X + = X- = 2M for the upper vertex and X + = X_ = —2M for the lower diamond vertex 
(which we have called N). Thus, any set A included in this region will be carried to N to an exponentially 
fast velocity for large s 

5X' ± = X' ± + 2M~4M & ~ 2nS ■ (?6) 

This means due to (|74j) that any mutual information of a set in the diamond will also be achieved for sets 
as near H as we want. 

The modular group would in general not act geometrically in the region opposite to the diamond 
(region II in figure V). However, if we can make the black hole to come to a thermal equilibrium at a 
temperature T with the radiation in the distant future (or the radiation in thermal equilibrium alone), 
by imposing boundary conditions or using an asymptotically AdS space, then the modular group will 
act as time translations there. This is because since the reduced density matrix is thermal p2 ~ e~ H ^ T 
the modular group is U(s) = e~ iHs / T . In this case, the mutual information for the exponentially small 
(~ e~ 2ns ) and exponentially near N set A' with the region B' time translated by an amount At = s/T 
is the same as I (A, B) for any value of the parameter s as large as we want. Note that this relation 
somehow encodes the Hawking effect. If in the black hole model of Section III we keep the mass constant 
m(v) = M by providing the necessary incoming radiation, this is exactly the relation we find from the 
one between coordinates corresponding to (|40p 



log(U h -[/) = --— = --— = -27rs , (77) 
4M ATM ' y J 



where we have used the black hole temperature T = (8ttM) . 
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5 A finite semiclassical Bekenstein entropy and the generalized second 
law 



We now comment on an interesting phenomena which was pointed out in [17] (see also [2CHI55]) and which 
we interpret here as a possibility to understand the Bekenstein-Hawking entropy at the semiclassical level. 
This, contrary to the usual assumptions about quantum gravity, does not require a reduction of degrees of 
freedom, or a cutoff, at high energies, but exactly the contrary, a large number of field degrees of freedom. 

In order to proceed consider two sets with large parallel faces of area A at a distance which is larger than 
the inverse of the free field mass, ML 3> 1. The formula (|17|) gives a mutual information exponentially 
decreasing with L 

T(A m~ A ( ML )^ r ~2ML / 7 ox 

I(A,B)^ —-^—e , (78) 

which is the same for each free bosonic and fermionic degree of freedom. This follows from the asymptotic 
form of the entropic c-functions c(t) ~ l/4(ng + np)tKi(2t) for large i > 1, with K\ the modified 
Bessel function |26|, 127] . Suppose now that we have a series of independent massive fields and call the 
density of fields per unit mass p{M). If this density is exponentially increasing at high values of the mass 
p(M) ~ f(M)e M / TH , it is known that the theory acquires a Hagedorn highest possible temperature Th- 
This also implies that the mutual information 

I(A, B)^A 2-( d+2 )l-^tt-^ J" d M f(M) e~ M ^ L ~ T ^ (79) 

diverges when the sets are at a finite distance to each other smaller than dn = T^ 1 /2 (rather than when 
they are in contact). Remarkably, at the critical distance the mutual information can still be finite. This 
depends on whether the integral of f(M) is convergent. If the energy density for the Hagedorn 

temperature is finite then the mutual information for critical distance is also finite. Thus, the mutual 
information may incorporate a natural distance cutoff dn which effectively makes I (A, B) bounded for a 
fixed A (and B in the same spatial plane), giving place to a form of entropy bound. For B at distances 
smaller than dn, strictly speaking, the mutual information cannot be defined [20] (there is no product 
Hilbert space Ha <8> Hb). 

We have argued above that the mutual information can represent the Hawking radiation entropy 
unambiguously. If there is a Hagedorn temperature, it is possible that it could incorporate also the black 
hole entropy. Let us think again in the Cauchy surface Si in figure 3(a). We take as A the region of the 
black hole interior on this surface and the second set B the subset of the exterior which makes I(A, B) 
maximal. This incorporates a distance cutoff to the horizon in a way which resembles the stretched 
horizon or the brick wall models [TJ [56]. This Cauchy surface dependent mutual information Js, then 
has a term proportional to the horizon area due to the tower of massive degrees of freedom, and also 
the contribution of the massless fields due to the Hawking radiation. Note that (I68p implies that the 
contribution of the massive modes to the Hawking radiation is negligible for black holes larger than the 
Hagedorn temperature. As a bonus, this interpretation of the black hole entropy has the advantage of 
not requiring any discreteness in the space-time nor in the fundamental degrees of freedom. 

This proposal would have to be completed in two different directions. In first place there should be 
a connection between the Hagedorn transition and gravity in order that the coefficient of the area in the 
mutual information at the critical distance to be equal to (2G) _1 . We can only mention in this sense the 
works |57] which are also related to the connections between black holes and a Hagedorn transition. 

As a second requirement one would like to be capable to explain the generalized second law (GSL). 
The existent versions of proofs of the GSL at the semiclassical level obviate that the entropy diverges and 
cannot be well defined. The Sorkin's proof [58] rely on the fact that the time evolution outside the black 
hole is autonomous. This interesting proof cannot be directly translated to our candidate for the total 
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entropy (black hole plus radiation) as half the maximum mutual information at least in an obvious 
manner, since this later also depends on the black hole interior (see [32] for a related discussion). 

On the other hand, the content of the Frolov and Page proof for the quasistationary case is very 
similar to what is required here [59J. We recall their work. In the quasistationary regime they map the 
problem to one of a scattering in an eternal black hole. The two different times corresponding to Si and 
E2 at which one should compare the entropies are taken to be the asymptotic past and future. One then 
makes the Cauchy surface partition Si = H~ U I~ and S2 = H + U / + , with H + and H~ the future and 
past horizons respectively. Then, they argue that the initial state should be taken as uncorrelated, 

PS! = Ph- ® Pi- , (80) 

a key assumption which usually makes the second law work. For the final state we have by the unitary 
evolution S(T,2) = <S*(Si) = S(H~) + S(I~), and by subaditivity of the entropy 

S(H+) + S(I + ) >5(S 2 ). (81) 

Then we have 

S(H + ) + S(I + ) > S(H~) + S(r). (82) 

Now, one introduces elements from the quasistationary nature of the evaporation. The emitted state pu- 
is taken thermal at the black hole temperature T, and the Hilbert space on H + and H~ are assumed to 
be equivalent, mapped to each other by the CPT operator O of the eternal black hole geometry, which 
commutes with the Hamiltonian. This implies that we can use the positivity of S{pu- \®ph+@^) to show 
that the free energy is a minimum for the thermal state pjj- 

^ F -S(H-)<^-S(H + ), (83) 

where E + and E~ are the incoming and outgoing energies respectively. Assuming the usual thermody- 
namical equilibrium relations hold between entropy, energy and temperature of the black hole and the 
quasistationary approximation gives 

AS BH = E+ ~ E ~ . (84) 

Then, the combination of the inequalities (|82p and (|83p give the GSL, 

S I+ - Sj- + AS BH > . (85) 

Note that this demonstration makes no particular use of the specific expression of the black hole entropy, 
and only rely on its character of thermal reservoir. This fact underlies the general validity of the inequality 

§3} M- 

In terms of the mutual information the equation (|80p reads I(H~,I~) = 0, and the eq. (|8ip is 
I(H+,I+) > 0. Then (Jg2J) means 

I(H + ,I + )>I(H-,r), (86) 

which can be taken as a quasistationary idealization for to the proposal here Js 2 > This equation 
just expresses the development of correlations between the system and the hidden sector (the black hole 
here), which is the mechanism of the second law. 

The results of the previous section suggest a different formulation of the second law. This requires 
interpreting I(A, B) as the sum of the black hole plus exterior entropy, but where now A is kept fixed 
as a neighborhood of the H point, and B is again the subset of a Cauchy surface S exterior to the black 
hole and which maximizes the mutual information. This formulation has the advantage of making explicit 
a difference between past and future, which is present in the second law. Presumably, the contribution 
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from the massive modes would also be proportional to the black hole area on £ in this case. In the 
quasistationary scheme of Frolov and Page we have to take as H the hidden sector to which the thermal 
state pn- is entangled, or, more generally, a sector which allows the purification of the total state, that 
is, such that pj-uff-uN i s pure. Thus we have £(N U I~) = S(H~) and, assuming that the time evolution 
does not mixes the N sector with the rest, 5(NU / + ) = S(H + ). Using this and the inequalities (|83p and 
(j85|) the GSL in this new formulation follows 



where, as above ASbh = AE/T takes its quasistationary value (in the full version it would be replaced 
by the contribution of the massive modes to and Again, this also holds in a more 

general context, since we only have taken into account the black hole as a thermal reservoir which is 
initially uncorrelated to the exterior system. This is behind the general validity of (|64p and (|65|). 

6 Final comments 

It is generally believed that in the physical applications of the concept of entropy it must not be taken 
as an absolute quantity, but it has to be understood with reference to some form of coarse graining. In 
practice however, this later is rarely spelled out. It is therefore satisfying that the combination of relativity 
and quantum mechanics might impose one such specific form of coarse graining, only allowing to talk of 
the entropy enclosed in a region while in relation with another one, in the form of shared information. 

Indeed, the mutual information seems to be the right tool to deal with the vacuum fluctuations which 
plague of divergences the localized entropy in QFT. This is specially relevant for dilute entropy scenarios 
such as the Hawking radiation escaping an evaporating black hole. The calculations done so far in this 
paper shed only a partial light on this issue and we have certainly asked more questions than obtained 
answers. 

One important point to be further explored is the extensivity of I (A, B) in the radiation region, which is 
suggested by the distribution of energy, but does not seem to follow from a simple derivation. In this sense, 
it would also be relevant to find out the conditions implying the extensivity of the mutual information, 
I(A,B,C) = 0, in a general quantum system. In particular, we do not have a simple understanding of 
the exact extensivity in conformal theories. One may wonder if there is a deeper connection between 
extensivity of the mutual information in the radiation region in the full higher dimensional problem and 
two dimensional conformal theory. 

If the mutual information can replace the notion of entropy, the extensivity may also be relevant in 
a different direction. Suppose an entropic object falls to the black hole. In order to include its entropy 
in the scheme one should express it as mutual information with a hidden sector A 1 (which can also be 
another black hole or a cosmological horizon). Then, it is natural to expect that the mutual information 
of the exterior region B with the black hole A plus the hidden sector, I {A U A', B), separates as a sum 
I(A, B) + I(A' , B). One can assume that A and A 1 are uncorrelated between themselves, but the reason 
why the correlations of these degrees of freedom with a localized B would have additive information is 
not clear. 

Another subject of interest concerns the stability properties of I(A, B) in QFT against changes in the 
high energy sector of the theory. This is relevant regarding the information loss paradox. If information is 
retrieved in the Hawking radiation one must understand through which mechanism the indications of this 
paper fail: monotonicity in the radiation region and a large information gathering in the neighborhood of 



We think it is also very important to understand the black hole entropy at the semiclassical level. 
This is especially relevant since there has not been a systematic investigation up to present as to whether 
quantum gravity is a physically decidable theory. For example, according to one school, effects of quantum 




(87) 
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gravity lead to the information recovery in the Hawking radiation after complete black hole evaporation. 
However, even if this may hold in a particular model, it would not be possible to check it experimentally. 
This is not due to practical reasons, but because of fundamental ones, since gravitons are inevitably 
emitted, and there is not enough matter to fully measure them in an asymptotically flat (or de Sitter) 
space [60] , 
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